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Abstract 

(N 

^-H Performance bounds for opportunistie networks have been derived in a number of recent 

^^ papers for several key quantities, such as the expected dehvery time of a unicast message, 

CN or the flooding time (a measure of how fast information spreads). However, to the best of 

f^ our knowledge, none of the existing results is derived under a mobility model which is able 

^ to reproduce the power law+exponential tail dichotomy of the pairwise node inter-contact 

' time distribution which has been observed in traces of several real opportunistic networks. 

r~~; The contributions of this paper are two-fold: first, we present a simple pairwise contact 

model - called the Home-MEG model - for opportunistic networks based on the observation 
made in previous work that pairs of nodes in the network tend to meet in very few, selected 
locations (home locations) ; this contact model is shown to be able to faithfully reproduce the 
power law -(-exponential tail dichotomy of inter-contact time. Second, we use the Home-MEG 
model to analyze flooding time in opportunistic networks, presenting asymptotic bounds on 
flooding time that assume different initial conditions for the existence of opportunistic links. 
Finally, our bounds provide some analytical evidences that the speed of information 
VJ spreading in opportunistic networks can be much faster than that predicted by simple geo- 

metric mobility models. 

(N 

vo 1 Introduction 

"^ . 

<^ Opportunistic networks are a special class of mobile ad hoc networks where node density is so 

^~~^ low that the network is disconnected at virtually any time. Communication is possible even in 

• • such a challenging environment by exploiting the so-called store- carry- and-forward mechanism, 

.J^ according to which a packet is stored in node ^'s buffer and carried around by the node till 

/\ a communication opportunity with another node B arises (whence the name of this class of 

^ networks); at this point, the packet can be forwarded from A to B, and the process is repeated 

until the packet is eventually delivered to the destination. 

Opportunistic networks are receiving increasing attention in the research community, since 
many emerging application scenarios can be considered as instances of opportunistic networks. 
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This is the case, for instance, of vehicular networks (at least, when traffic density is medium to 
low), of certain types of mobile sensor networks, and of pocket switched networks. The latter 
type of network is formed by powerful handheld devices - able to establish direct wireless com- 
munication links through, e.g., a WiFi interface - carried around by humans in their everyday 
hfe. 

A crucial aspect when attempting to analyze opportunistic network performance is mobility 
modeling. In fact, node mobility is a crucial communication means in opportunistic networks, 
and different assumptions about node mobility patterns might lead to different conclusions 
about network performance. Thus, credible conclusions about network performance for what 
concerns, e.g., flooding time, can be drawn only if the mobility model used in the analysis 
closely resembles mobility features observed in real-world mobility traces that are available in 
the literature - see, e.g., those collected in [6]. In particular, the mobility feature that most 
profoundly impacts opportunistic network performance is the inter- contact time, defined as the 
time elapsing between two consecutive encounters of any two nodes A and B (see Section ^ for 
a formal definition). 

It has recently been observed that the inter-contact time distribution in real-world mobility 
traces displays a dichotomy: it initially obeys a power law, but it has an exponential tail 
for relatively large time values [H]. This so-called power law-|-exponential tail dichotomy of 
inter-contact time distribution is now commonly accepted in the community as a distinguishing 
feature of human mobility. Another interesting finding of [H] is that the inter-contact time 
between a pair of nodes in the network is positively correlated with the return time to some 
"home locations" . In other words, a specific pair of nodes tends to repeatedly and quite regularly 
meet in a few selected locations within the movement area, which explains the exponential decay 
in the tail of the inter-contact time distribution. 

Recently, several mobility models aimed at reproducing the power law+exponential tail 
dichotomy of inter-contact time have been introduced in the literature [U [HI [l2l [151 [El 120] , 
most of which are based on the notion of home location first introduced in [T55. However, to our 
best knowledge, existing results on asymptotic performance bounds for opportunistic networks 
are based on much simpler models, which are not able to reproduce the power law+exponential 
tail dichotomy of inter-contact time observed in [Tl] - see next section. Thus, the problem of 
analyzing opportunistic network performance based on a mobility model able to reproduce this 
dichotomy is still open. 

In this paper, for the first time, we present a theoretical analysis of opportunistic network 
performance based on a pair-wise contact model (which can be thought of as an abstract mobil- 
ity model) able to reproduce the power law+exponential tail dichotomy of inter-contact time, 
thus addressing the above mentioned open problem. More specifically, we first introduce a 
simple, Markovian model of communication opportunities between pair of nodes, based on the 
observation made in [14] that nodes tend to meet at few selected locations (homes), which 
explains the name of the model - Home-Mar kovian Evolving Graph (Home-MEG, in short). 
Next, we validate the Home-MEG model against several real- world data traces, and show that 
it is able to closely reproduce the power law+exponential tail dichotomy of inter-contact times. 
Finally, we use the Home-MEG model to derive theoretical performance bounds on hooding 
time in opportunistic networks as the size n of the network grows. As we shall see, our bounds 
suggest a radically different performance than that predicted by previous studies on hooding 
time based on geometric, random walk- like mobility models [H [5l [211 122]: flooding might be 
very fast in opportunistic networks having dichotomic inter-contact time. 

Before ending this section, we want to comment about the importance of characterizing 
flooding performance in the asymptotic regime. The flooding time (or broadcast time) is deflned 
as the time necessary for a node to deliver a packet to every other node in the network. Accurate 



estimation of the flooding time can help researchers and network designers answer questions 
such as: "Which is the time needed for a warning message issued by a vehicle to reach every 
other vehicle in the network?", or "Which is the time needed for a virus generated by a node 
in a pocket switched network to propagate in the entire network?" , and so on. Our interest in 
asymptotic analysis stems from the fact that the number of nodes in opportunistic networks 
is likely to be very high: think about the number of vehicles forming a vehicular network in a 
large city, or the number of individuals forming a metropolitan- wide pocket switched network. 

2 Related works and contributions 

Several mobility models have been recently introduced in the literature with the purpose of 
faithfully reproducing features observed in human mobility traces [U [HI [l2l [151 [181 [20] . I^i feet, 
it has been observed that well-known mobility models such as random-waypoint and random 
walks cannot be directly used to faithfully reproduce human mobility, especially for what con- 
cerns the observed power law+exponential tail dichotomy of inter-contact time distribution [14J. 
However, these models can be modified to (explicitly or implicitly) account for some form of 
social relationships between individuals, so that features observed in real-world contact traces 
emerge also in the traces generated by the mobility model at hand. Unfortunately, all the 
recently introduced models of human mobility share the property of introducing a strong inter- 
dependency between movement patterns of nodes in the network (e.g., nodes with strong social 
ties tend to have quite similar mobility patterns), which renders the analysis of asymptotic 
opportunistic network performance based on these models extremely difficult. 

As a matter of fact, existing opportunistic network performance analyses are based on much 
simpler mobility models, if not on more abstract models of pair-wise contacts in which node 
trajectories in a geographical domain are not modeled, but only possible pair-wise contact 
opportunities. For instance, in [9l [23l [25l [27] the authors study unicast performance under the 
assumption of i.i.d., exponentially distributed inter-contact times between any pair of nodes in 
the network. Similarly, in [2] the authors analyze unicast performance under the assumption of 
inter-contact times distributed according to a power law, while the analysis of [10] is based on the 
assumption of a truncated power law to model inter-contact times. Simple geometric, random- 
walk mobility models have been instead used to analyze flooding performance in opportunistic 
networks - see [i [S, [HI [ZH [22] . 

Summarizing, we currently are in a situation in which there is a quite deep understanding 
of the main features of human mobility, and a good number of mobility models aimed at repro- 
ducing these features have been designed. On the other hand, existing analytical performance 
bounds for opportunistic networks have been derived based on much simpler mobility models, 
which are known not to be able to reproduce typical human mobility patterns. Thus, currently 
there is a gap between the state of knowledge in human mobility modeling, and what is known 
about opportunistic network performance bounds. 

In this paper, we make a substantial step forward towards closing this gap by presenting, for 
the first time, asymptotic performance bounds for opportunistic networks based on a contact 
model which we show to be able to reproduce a salient feature of human mobility, namely, that 
pair-wise inter-contact times display a power law+exponential tail dichotomy. More specifically, 
we present bounds on flooding time, which were previously derived only using simple geometric, 
random- walk mobility models which are known not to be able to reproduce the above described 
dichotomjQ 



^Although the authors of fT3' introduce versions of random waypoint and random walk mobility able to 
reproduce the power law+exponential tail dichotomy, the mobility models used in the analyses of [4l[5l ll3ll21U22) 
are based on standard random waypoint/random walk models, which do not reproduce the inter-contact time 



Before ending this section, we want to outline that different mobility assumptions have a 
profound impact on the derived asymptotic performance bounds. For instance, considering 
unicast performance, denoting with E[Tn] the expected packet delivery time in a network with 
n nodes, we have that limn^oo E[Tn] = c, for some constant c > 0, in case of exponentially 
distributed inter-contact times [9l[25l[27], while linin^oo E[Tn] = +00 if inter-contact times 
obey a power law with sufficiently low exponent f^. 

The bounds derived in this paper seem to suggest a similar impact of mobility assumptions 
on flooding performance. In fact, if we consider n nodes moving with random walk- like mobility 
over a square area of side L = Q{y/n) with transmission range and node speed set to some 
positive constant, then the time needed to complete flooding is Q{^/n), a lower bound which 
has been recently shown to be almost tight |2HI22j. Thus, existing results suggest that flooding 
time grows unboundedly with n, at rate which is polynomial in n. On the contrary, our results 
suggest that, in a network with similar density as that considered in [2^122]. flooding time grows 
unboundedly with n, but with a rate which is logarithmic in n - for a more detailed discussion 
on this important issue, see Section |4] 

3 The Home-MEG model: definitions and validation 

3.1 The Home-MEG model for a single link 

When modeling asymptotic opportunistic network performance, at least the following two op- 
tions can be undertaken to model inter-contact times: i) using a geometric mobility model 
defined in a specific geographic domain (e.g., random- walk in a unit square), and deducing 
pair- wise contact patterns based on node trajectories, coupled with a notion of transmission 
range - see, e.g., [H [5l [13l [211 [22]; or ii) abstracting details of the underlying geographic do- 
main and modelling directly inter-contact time between nodes - see, e.g., [21 [9l [THl [23l [25[[27]. 
Both approaches have pros and cons. In the case of i), pair- wise contact traces are by construc- 
tion consistent with a reference geometry and, under that respect, can be considered realistic. 
On the other hand, the analysis of asymptotic performance bounds under approach i) is quite 
complex, due to the need of deriving contact traces from an underlying mobility model. As a 
matter of fact, this intrinsic complexity is the reason why existing analyses based on approach i) 
are restricted to very simple underlying mobility models such as random waypoint and random 
walk, which are known not to be able to reproduce the power law -|- exponential tail dichotomy 
of inter-contact time distribution typical of human mobility. Conversely, approach ii) is apt for 
asymptotic performance analysis, since pair- wise contact patterns are modeled directly instead 
of being derived from an underlying mobility model. On the other hand, contact patterns un- 
der approach ii) lack to refer to an underlying reference geometry, hence some aspects related 
to human mobility (e.g., the fact that when a node is in a certain region R of the movement 
domain, it can come into contact only with nodes that are located in R or in nearby locations) 
are not captured by contact models. 

In this section, we introduce, to the best of our knowledge, the first contact model which: 
1) is able to faithfully reproduce the power law-|-exponential tail dichotomy of inter-contact 
time typical of human mobility, and 2) can be used to derive asymptotic performance bounds 
for opportunistic networks. Although we admit that, as described above, contact models do 
not capture all aspects of human mobility, based on 1) we claim that the model defined herein 
is realistic in the sense that, differently from the models used to derive existing asymptotic 
performance bounds |2l[ll[5l[9l[l0l[l3l[2ll[22l[23l[25l[27j, itis able to reproduce the power 
law-l-exponential tail dichotomy of inter-contact time observed in human mobility traces. 

dichotomy. 



As the name suggests, to increase accuracy of the generated contact traces the Home-MEG 
model is based on the notion of home location introduced in |l3j ■ A pictorial representation of 
the Home-MEG model is reported in Figure [T| The state transition diagram reported in the 
figure models probability of occurrence of a wireless link between a specific pair of nodes u,v in 
the network at discrete time t. We can interpret the state transition diagram reported in Figure 
[T]as follows. The pair of nodes n, v can be in one of two states: Home state, corresponding to the 
situation in which both nodes u and v are at one of their home locations; and Non-Home state, 
corresponding to the complementary situation in which one of the nodes (or both) are not in a 
home location. We recall that, according to [H], any pair of nodes in an opportunistic network 
tend to repeatedly meet in a few locations (home locations), while meeting opportunities outside 
the home sites are occasional. To account for this, in the Home-MEG model the probability of 
establishing an instantaneous communication opportunity (i.e., a contact between u and v) at 
time t depends on the state of the pair: it is a with < a < 1 when the pair is in state H, 
and it is 7 with < 7 < a when the pair is in state NH. Finally, two further parameters of 
the model are the probability q of making a transition from Home to Non-Home state, and the 
probability p of making a transition from Non-Home to Home state. Summarizing, the model 
is fully characterized by four parameters: the state transition probabilities p and q, and the 
contact opportunity probabilities a and 7. 




Figure 1: The Home-MEG model. 

The goal of the Home-MEG model is modeling com,munication opportunities or contacts 
between u and v, where a communication opportunity is intended here as an instantaneous 
event during which an arbitrarily large number of messages can be exchanged. This notion 
of communication opportunity is consistent with a scenario in which the time step considered 
for analyzing information propagation within the network is relatively large, and the amount 
of data to be transferred between nodes is small. For instance, we can think of time steps 
elapsing a few minutes - this is consistent with all real- world data collected so far [14] ; if a 
communication opportunity occurs between u and v at time to, it is very reasonable then to 
assume that all messages in the respective buffers can be exchanged well before the next time 
step ti. 

Finally, notice that the Home-MEG model as defined here is similar to the Gilbert-Elliot 
model [U E] of communication channel, which has been extensively used in the communication 
engineering community to estimate probability of bit error bursts on a channel. 

3.2 Inter-contact time in the Home-MEG model 

The inter-contact time distribution in the Home-MEG model can be derived recursively as 
follows - see [IT], where the Gilbert-Elliot model is used. In the following, IC is the random 
variable corresponding to the number of time steps elapsing between two consecutive contact 
opportunities. 

Let ph and pnh represent the stationary probabilities of finding the pair of nodes in state 



Name 


Technology 


Duration 


jj devices 


jj contacts 


MIT Cell 


GSM 


16 months 


89 


1,891,024 


MITBT 


Bluetooth 


16 months 


89 


114,046 


Infocom06 


Bluetooth 


3 days 


41 


28,216 


Vehicular 


GPS 


6 months 


196 


9,588 


UCSD 


WiFi 


77 days 


275 


116,383 


Cambridge 


Bluetooth 


11.5 days 


36 


21,203 



Table 1: Main features of the considered real- world data traces. 

H and NH, respectively. It is easy to prove that 

V 



PH 



p + q 



and pNH = 1-PH 



The probability P{H\Contact) (respectively, P{NH\ Contact)) of finding the pair in state H 
(respectively, NH), conditioned on the event that a contact occurs can be computed applying 
Bayes' theorem: 



P{H\Contact) 



P{Contact\H) ■ P{H) 

P{Contact) 
a ■ p 



and 



p- a + q- J 

1 -q 



P(NH\Contact) = 

p- a + q -J 

The value of P{IC = k), for any k > 1, is recursively defined as follows: 



where 



P(IC = k) = P{H\Contact)PkH + P{NH\Contact)PkN 

PiH = {l-q)a + q7 , PiAf = (1 -p)7 + pa 
PiH = g(l - l)P(i-i)N + (1 - g)(l - a)P(i-i)H 

PiN = (1 -p)(l-7)^(i-l)Af +P(1 -a)P(i-i)H 



i = 2,..,k 



3.3 Validation 

Can we set the parameters of the Home-MEC model so that it fits real-world data traces? 
To answer this question, we have considered the six data traces used in [Hj, selecting a few 
representative points from the corresponding complementary cumulative density function (ccdf ) 
of the aggregate inter-contact time distribution. The main features of the considered traces are 
reported in Table [I] (see |14j for more details on the data traces) . We have then performed 
an iterative search on the four-dimensional parameter space of the Home-MEG model, aiming 
at minimizing the mean square error (in log scale) between the data trace of reference and 
the ccdf produced by the Home-MEG model. The resulting best fittings for the six traces are 
reported in Figure [2} As seen from the figure, the fitting is very good, and in particular the 
well-known power-law -|- exponential tail behavior of the aggregate inter-contact time ccdf is 
well reproduced by the Home-MEG model. 

The values of the four model parameters corresponding to the best fit for the six traces 
are reported in Table [2] In all cases, the time step in the Home-MEG model was set to 86.4 
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Figure 2: Best fit of the Home-MEG model (red curve) vs. six real- world data traces (blue 
curve) . 



Trace 


P 


9 


a 


7 


PH 


a/7 


p + q 


MIT Cell 


7.5 • 10-5 


3.3-10-3 


1.8- 10-1 


7.8 - 10-3 


0.022 


2,271 


3.4- 10-3 


MIT BT 


4.5- 10-5 


1.5-10-4 


1.2- 10-3 


8.6-10-'^ 


0.228 


1,368 


2.0- 10-4 


Infocom06 


3 • 10-3 


2.5-10-2 


7-10-2 


3 - 10-4 


0.107 


233 


2.8- 10-2 


Vehicular 


4.1 • 10-4 


7.9-10-3 


2.1- 10-2 


7.7 - 10-5 


0.049 


275 


8.3-10-3 


UCSD 


1.1- 10-4 


1.3-10-2 


10-10-2 


1 - 10-5 


0.008 


10,000 


1.4-10-2 


Cambridge 


2.5-10-4 


8.3-10-3 


4.7- 10-2 


4.6 - 10-4 


0.029 


102 


8.6-10-3 



Table 2: Best fit parameter values for the six data traces. 

seconds. The table reports also the stationary probability pn of finding the system in HOME 
state (both nodes in one of their home locations), the ratio a/7 of link occurrence probabilities 
in the HOME and Non-HOME states, and the sum p + q of the probabilities of making a 
state transition. As seen from the table, despite the different values of the four parameters for 
the different data traces, some general conclusions about these values can be drawn, namely: 
i) users spend most of the time in Non-HOME state - pn is consistently much smaller than 
Pnh = 1 — Ph', ii) a/7 S> 1, i.e., a ^> 7, which is in accordance with the intuition upon which 
the model has been defined; in) p+q <^ 1, i.e., states are persistent (otherwise stated, transition 
events occur seldom). 

Notice that property i) above is apparently in contradiction with recent characterizations 
of human mobility, according to which individuals tend to spend most of their time in a few 
locations [24j. However, we reiterate that, differently from most literature on human mobility 
models, the notion of home in the Home-MEG model refers to a pair of nodes, and is aimed at 
modeling the fact that the two nodes are simultaneously present in one of their meeting sites 
(home locations). Thus, the fact that a single node u spends most of the time in a few locations 
is not in contradiction with the fact that node u and, say, node v are only seldom co-located in 
one of their meeting sites. 

3.4 Evolving networks and Home-MEG Model 

The Home-MEG model can be used to describe the evolution of a network of mobile nodes that 
can have pairwise interactions over time. For the purpose of analysis, we now formally re-state 
the Home-MEG model as a collection of independent Markov chains. Consider a time evolving 
graph Q = ([n],£'j), where [n] is the (fixed) node set and Et is the time evolving edge set. 
Suppose every edge of the graph can be in four states {HC, HD, NC, ND}. In states HC and 
NC we say the edge exists (and the nodes are in Home and Non-Home states, respectively), 
while in states HD and ND we say the edge does not exist. The Markov chain over such a 
state space is given by the following transition matrix: 



/ 


HC 


HD 


NC 


ND \ 


HC 


(l-g)Q 


(l-g)(l-a) 


97 


g(i-7) 


HD 


(l-g)a 


(l-g)(l-Q) 


57 


g(l-7) 


NC 


pa 


pil-a) 


(1 -p)7 


(l-p)(l-7) 


i^ ND 


pa 


p{l-a) 


(l-p)7 


(l-p)(l-7) J 



(1) 



Such a chain is clearly ergodic (irreducible and aperiodic) and it is easy to verify that the 
following state distribution is stationary 



1 



HC 



HD 



NC 



ND 



vr 



P + q\ pa p{l — a) qj 9(1 — 7) / 
The Home-MEG model for a network of n nodes can now be formally defined as follows: 

Definition 1 (Home-MEG) A Home-MEG model % = T-L{n^p,q,a,^) on n nodes, where 
n G N^ and ^ p,q,a,'y ^ 1, is a sequence of random, graphs % = {Gt = {[n],Et) : t G N} 
such that the set of edges at time t is 



Et 



e e 



n 



: Xtie)e{HC,NC} 



where < {Xt[e)}t£n '■ e £ ( 2 ) \ is a family of independent Markov chains with transition matrix 
as in ^. 

Depending on the specific assumptions, the initial set of edges Eq, which is determined 
by the initial state of the Markov chains, can be arbitrary, or determined by the stationary 
distribution of the Markov chains (see below) . 

We stress again that the notion of Home state in the Home-MEG model is referred to a 
specific node pair, and that the n{n — l)/2 identical Markov chains used to model all possible 
links in the network are independent. The above implies that, for instance, the transitive 
property does not hold. In other words, the fact that the pairs of nodes {u, v} and {u, w} are 
in Home state does not imply that also pair {v, w} is in Home state. Thus, Home state in the 
Home-MEG model must not be intended as a single popular location within the network where 
all the nodes regularly meet, but rather as an abstract location corresponding to the (few) 
physical locations where a specific pair of nodes regularly meet. Notice that the independence 
assumption in Definition [T] implies that meeting locations are assumed to be different for each 
pair of nodes. Admittedly, this is an approximation of reality, where meeting locations are likely 
to be shared by several pairs of nodes. However, in the previous section we have shown that, 
despite this approximation, the Home-MEG model is able to accurately reproduce the aggregate 
inter-contact time distribution observed in several real-world traces. 

4 Flooding Time of Home-MEGs 

Let G be an evolving graph, the flooding process in G proceeds as follows: Start with one single 
informed node and, during the evolution of the graph, when a non-informed node gets in touch 
with an informed one, it collects the information. 

Let G = {Gt = {[n],Et) : t G N} be an evolving graph and let s G [n] be a node. 

Definition 2 (Flooding Process) The flooding process in G with source s is the sequence 
{If ■ t G N} of sets of nodes defined recursively as follows 



• Iq = {s} (at the beginning only the source node is informed j; 

• It-\-i = It UiVi+i(/t) (the neighbours of informed nodes become informed) 
where Nt+i{It) is the neighborhood of It in graph Gt+i, 

Nt^iilt) = {v £ [n] - It I {u,v} G Et+i for some u G It} 

Given an evolving graph Q and a source node s G [n], the completion time T{Q, s) of the flooding 
process is the first time step in which all nodes of the network are informed. The flooding time 
T[Q) is the maximum completion time over all possible choices of source s G [n]. 

4.1 Upper and lower bounds holding for any starting edge set 

In the following theorem we give upper and lower bounds on the flooding time of Home-MEGs, 
without any specific assumption on the initial set of edges Eq. Notice that this implies that the 
obtained bounds also holds over the transient phase of the Home-MEG. 

Theorem 3 Let % = T-L{n^p,q,a,^) be a Home-MEG with p + q ^ 1 and 7^0. Then, the 
flooding time Tn(H) ofH is 



Tn{n) 



O(T^i^)] -^-h-P. 



iog{i+nq) ) ^^ expectation 



where p = pa + {1 — p)j and q = {1 — q)a + 97 |j 

Proof. Recall that the random variable Xt{e) describes the state of edge e G ( 2 ) at time t, 
according to transition matrix M. Then, the probability edge e exists at time t is 

{(1 — q)a + 07 if a; G H 
pa + (1 — p)7 it X G JN 

where C = {HC, NC], H = {HC, HD}, and N = {NC, ND}. From the assumptions p + g ^ 1 
and 7 ^ Q, it follows that g = (1 — q)a + qj '^ pa + {1 — p)j = p. Hence, the probability an 
edge exists at time t is at least p and at most q, regardless of the state of the edge at time t — 1. 

Now consider two random evolving graphs G^ = {G^ : t G N} and Q'^ = {G\ : t G N} 
where Gf's and Gj 's are independent Erdos-Renyi random graphs G„^p and Gn^q respectiveljrl 
Since at every time step every edge exists in Ti with probability at least as large as in G^ and 
at most as large as in G^, it is intuitively true that the flooding time of H is at most as large 
as the flooding time of Q^ and at least as large as the flooding time of ^'. Since from [3] we 
know that the flooding time of Q^ is Tn{Q^) = O i jq n^-) ) w.h.p. and the flooding time of Q'^ 

is Tn{G'^) = ^ ( iog(°i^-i^q) ) ill expectation (see Theorems 3.7 and 5.3 in [3j), the thesis follows. 

A formal proof can be derived by using the coupling technique (e.g. see [E]). Roughly 
speaking, one can define random processes Ti, Q^ and G"^ on the same probability space, in a 
way that, naming Et{'H), Et{GP), and Et{Q'^) the (random) sets of existing edges at time t in 



■^In the following, we use w.h.p. to denote the fact that an event En, depending on a parameter n, holds with 
probability P {En) ^ 1 ^ ^) for every n sufficiently large. 

^ An Erdos-Renyi Gn,p random graph is a graph on n nodes where, for every pair of nodes u, v in the graph, 
undirected edge {u, v) exists with probability p, independently of other edges. 



processes 71, Q^ and Q'^ respectively, it holds that Et{Q^) C EtiTi) C Et{Q'^) for every t. In 
what follows we make the above argument rigorous. 

We need some technical machinery, namely the coupling technique (e.g. see (16j). Roughly 
speaking, we will define random processes 7i and Q on the same probability space, in a way 
that, if we name E^ and E^ the (random) sets of existing edges at time t in processes Ti and 
Q respectively, it will hold that E^ C E"^ for every t. That can be achieved, for example, as 
follows. 

exists does not exist 
I \ 1 e(g) 



e{U) 



HC NC HD ND 



Figure 3: Coupling for the edge e at time t for processes Ti and G 

Let {Ut{e) : t G N, e G ( 2 )} ^^ ^ family of independent random variables uniformly distributed 
over the interval [0, 1]; at time t we use random variable Ut{e) to update the state of edge e in 
both processes Ti and G as illustrated in Figure [3j 

1. We take two copies of interval [0, 1] and we name them e{T-L) and e{G); 

2. We partition the e(^)-interval in two sub-intervals: the leftmost one, of length p, labeled 
exists and the rightmost one, of length 1 — p labeled does not exist; 

3. We partition the e(^)-interval in four sub-intervals: the two on the left labeled HC and 
NC and the two on the right labeled HD and ND. The lengths of the sub- intervals 
are chosen according to the transition probabilities from the previous state of edge e in 
process 7i. For example, if edge e at time t — 1 was in state HC, then the lengths are the 
probabilities in the first row of the transition matrix (IT]) ; 

4. We set the state of edge e at time t in processes Ti and G according to the label of the 
sub-interval containing Ut{e) in intervals e{Ti) and e{G) respectively. 

By construction, the probability edge e exists in process G is p, and the probability edge e is in 
one of the four states in process Ti agrees with the transition matrix (IT|. Moreover, since the 
sum of the lengths of sub-intervals HC and NC is always at least as large as p, if the edge exists 
in process G then it exists in process Ti; hence, at every time step t we have that E^ C Ef^ . 
Thus, if the flooding process in G is completed within time t, then the flooding in Ti has to be 
completed within time i as well. D 

4.2 A tighter upper bound for the stationary case (and slow changes) 

In the previous subsection, we have provided upper and lower bounds on flooding time for the 
Home-MEG model. Notice that, while the upper bound stated in Theorem [3] is not based on 
any specific assumption about the initial edge set Eq, it is not tight in general: when p + q <^ 1 
and 7 < a - which is the case in practical situations, recall the discussion at the end of the 
previous section - then pa + (1 — p)^ ^ (1 — q)a + qj and the lower bound on the probability 
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of the existence of an edge we used in Theorem [3] is too lose. Indeed, consider the following 
example: assume p = q = a = 1/n and 7 = l/n^. Notice that p = q means that, if £"0 is 
random according to the stationary distribution, edges are in Home location approximately half 
of the time. In this case we have 

p = pa + {I - p)j = 7 + p{a - 7) ss ^j 

and Theorem [3] gives an upper bound 0{n log n). On the other hand, the probability of existence 
of an edge, if it was in Home location at previous time step, is 

(1 - q)a + q-y = a- q(a - 7) w - 

n 

Hence, based on [3], one would expect flooding time to be something like C'(logn) in this case. 
In other words, in the proof of Theorem [3] we lower bounded the probability of existence of an 
edge in a given time step as the edge always was in Non-Home state at previous time step, and 
this is clearly a loose bound when the "stationary" expected fraction of edges in Home state 
is significant and 7 <C a. In order to obtain a tighter upper bound on the flooding time, we 
instead fully exploit the properties of the stationary configuration and get the bound stated in 
the theorem below. For convenience sake, in what follows we set 

j^ ^ 4(p + q) 
pa 

Observe that, according to (pi), we have 4/A = 7r{HC). 

Theorem 4 LefH = 'H{n,p,q,a,"f) be a Home-MEG and assume that \—~\ ^ min{l/a, l/(4g)}. 
Then, if the initial edge set Eq is random according to the stationary distribution vr, the flooding 
time Tnil-L) is w.h.p. 

logn 



^"^^^-^Vlog(l + n/A) 

The upper bound in Theorem E^ is not always applicable (see the condition on A and on Eq) 
and, in case of frequent changes between the Home and the Non-Home states (i.e. for large 
values of p and q), it can be much weaker than the upper bound of Theorernpl For instance, 
for 7 ~ 1/n, a ~ 1/n^'^, p ~ 1/n^'^, and q ~ 1/n^'^, the bound of Theorem Wlis O(n^'^logn) 
while the upper bound obtained by Theorem^ is ©(logn). 

On the other hand, the upper bound in Theorem [4] can be much tighter than that of Theo- 
rem [3] in the realistic scenarios described in Section [3] Indeed, we first observe that the assump- 
tion [— ] ^ min{l/a, l/(4g)} is in accordance with the "realistic" assumptions on p, g, a,7 
derived in Section Isl When p + q <C 1, the assumption is satisfied for any p ^ q/n. More 
importantly, the above theorem has the following interesting consequence. 

Corollary 5 Consider an opportunistic network with n nodes, and assume pairwise link occur- 
rences are modeled with the Home-MEG model, with parameters set as follows: 

n' 1 1 1 

n n^ n^^"^ n 

for some arbitrary constant < e < 1. Then, flooding in the network is w.h.p. completed in 
©(logn) time. 
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We observe that all conditions i), ii), and in) stated in Subsection 3.3 are met when the 
parameters of the Home-MEG model are set as in the statement of Corollary [5J Also, when 
the parameters of the Home-MEG model are set as above, the stationary expected edge density 
(i.e. the number of neighbors of a node) is 1, i.e., the network is well below the connectivity 
threshold of 0{logn) (this is consistent with the opportunistic network setting). With similar 
parameter settings, Theorem Is] gives a bound on flooding time of 0{nlogn). 

Our bound on flooding time for sparse Home-MEG leads to conclusions that largely de- 
part from those arising from simple geometry mobility models. Indeed, in [H [Sj [21} I22| . tight 
bounds on flooding time are derived based on the random walk model in which the mobility 
and transmission range of nodes can be fixed in a certain range. Informally speaking, n nodes 
move in a square region of side \/n, nodes transmission range is R, with c < R < -y/n, and 
the nodes mobility radius in one time step is 0{R). Under these conditions, the flooding time 

is shown to be ( ^ I . By setting the transmission range in such a way that the expected 

node degree is 0(1) (i.e. the same as in Corollary [5]) the results of [131 El 122] yield a Q{^/n) 
bound on flooding time, which is exponentially larger than the bound obtained (O(logn) vs 
0{^/n)) with the Home-MEG model (Corollary^. It is important to observe that both ex- 
isting bounds based on geometric mobility models and the bounds derived herein display both 
realistic as well as unrealistic features of the induced contact patterns between nodes, as al- 
ready discussed in Section [3| In particular, geometric mobility models faithfully capture the 
fact that occurrence/disappearance of a link between a pair of nodes is governed by physical 
notions such as distance between nodes, transmission range, etc. On the other hand, the specific 
mobility models used in the existing flooding time analyses [U [5l [2H [22] are known to be unre- 
alistic mainly because of two reasons: (i) the generated contact traces do not display the power 
law-l-exponential tail dichotomy of inter-contact times, and (ii) human and veichular mobility 
largely differs from basic random walks. The Home-MEG model used in our analysis tackles 
the issue of the inter-contact time dichotomy, but it is an abstract contact model in which 
occurrence of links between nodes is not related with physical notions such as distance between 
nodes, transmission range, etc., thus it is not realistic in that respect. Despite this unrealistic 
aspect, the experimental results presented in Section |3] indicate that the Home-MEG model is 
a reasonable candidate to estimate flooding time in opportunistic networks, while theoretical 
analysis of this model suggests that flooding time in opportunistic networks might be much 
shorter than predicted by existing polynomial bounds obtained from simple geometric mobility 
models. 

We want to comment on the fact that the Home-MEG model parameters are assumed to 
be dependent on the number n of nodes in the network, and in particular decreasing with 
n. This assumption appears reasonable, since a single Markov chain in Home-MEG models 
the existence/non-existence of the link between a specific pair of nodes in the network. For 
instance, this assumption is in accordance with a scenario in which the number of nodes (each 
with a fixed transmission range) in the network grows, while the density of nodes per unit area 
remains unchanged (constant density networks). Finally, we emphasize that, differently from 
previous related work [13] [21] , our theoretical analysis provides high concentration results on 
flooding time (the results hold with high probability rather than asymptotically almost surely^ 
and the constants hidden by the asymptotic notation are rather small, say not larger than 10. 
Both these properties are clearly crucial when the goal is to apply theoretical results to realistic 
scenarios. 



*An event _E„ holds a.a.s. if lini„_).oo P {En) = 1. 
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4.2.1 Proof of Theorem H 

We first provide an informal description of the main arguments. 

Let T-L = T-l{n,p,q,a,j) be a Home-MEG and, for any edge e and positive integer i, define tlie 
"connection" probability -P^(e) = P (3i G {0, ■ ■ ■ ,i} \ e G Ei). By assuming £"0 to be random 
with the stationary distribution, we prove that 



Pe{e) > -^^e for any £ ^ minil/a, l(4g)} 
v + a 



p + q 

The above bound says that, starting from the stationary distribution, the connection probability 
grows linearly in i, for a while. A natural approach would be that of exploiting this bound to 
derive some good node-expansion property of the time evolving graph and, then, applying this 
expansion to the growing subset of informed nodes along the flooding process. This approach 
implies that, starting from the stationary distribution, at any time of the flooding process, we 
are interested in the edges that connect informed to uninformed nodes. This in turn entails that 
the state of the same edge might be observed multiple times. Unfortunately, when the state of 
an edge is observed at a given time, then it cannot be considered "stationary" anymore. We 
overcome this technical difficulty by considering a "subprocess" of the flooding which is orga- 
nized in time-periods: at the beginning of every time-period, only "fresh" edges are considered, 
i.e., edges that have been never observed before. More precisely, let Cj be the set of nodes that 
have been informed during the i-th time-period, then only the edges in the cut (Cj, F — U*~QCi) 
will be considered in time period t. Finally, by setting the length of the time-periods in a 
suitable way, we get the bound. 

In the remainder of this section we provide the formal proof. We will make use of the following 
well-known inequalities: For any positive real < x < 1, it holds that 

1 - X ^ e"''' ^ 1 - x/2 (3) 

We first provide a general upper bound on the edge-disconnection time for any finite Markov 
chain governing the edge evolution. Let A be any subset of the states of the Markov chain and 
let D be the subset of states implying edge disconnection. Notice that in the Home-MEG model, 
D = {HD,ND}. Assume the starting edge configuration Eq is random with the stationary 
distribution vr and, for any state subset X of the Markov chain, define 

£f = "e eX,yt = 0,...,e" and vr/'^ = P {Sf^ \ £^) 

Lemma 6 Let A and D be such that: 

• Hyp. I. VseAnD, P(s^^nD)^A; 

• Hyp II. Vs G A, P{s^A)^ 6. 
Then, for any l^ 1 it holds that 

Proof. It holds that 

4''' = p(ff|£:o^Af/)p(f/|^o^)+p(ff|fo^A^f/)p(^£:/|fo^) 

^ P {eF I £^ A Ef) P {8f\8^) + 1 • (1 - P {8t\m) 

13 



4'" ^ 1 - p (^/l^o^) (1 - p {^F\£t)) (5) 

Let us first bound P (<?/) • 

P(£-/) = ^{st\£^^8t,)^{8t^\£^)+Y{Et\S^^^8t,)^{-.8t^£^) 

= ^{8t\8t,)^{8t,\£^) 
(by Hyp. II) ^ 5^{8t^\8^) 

So, we get 

P {£t) ^ ^' (6) 

As for the term P {£f\£f), it holds 

p ..D|.A^ _ P(gfAg/) _ P (gf ^-^) 

P l^^ 1^^ J - p (£:A) - p (£:A) (7) 

while, for P (f^'^"^), we apply Hyp. I and get 

P (^i^^^) = P [£P''''\£^''^) P (fo"""^) + P {£f''^\^£^^^) P (-£:o^"^) 

= P (<Si^n^|<Sf7^) • P (■Sf'^^l-So^'^^) P (£:o^^^) 

(by Hyp I) ^ AP(£:f7^|fo^'^^)vr(AnZ)) 

We thus get 

P (ff ^^) ^ A^7r(A n D) (8) 

As for P (<?/), 

P(f/) = P(^/|fo^)P(fo^)+0 
From Eq. [6] ^ ^^(yl) 

By combining the above bound with Ineq. [8j Ineq. [7] becomes 

Finally, by replacing the above bound and that in [6] in Ineq[5j we obtain 

n 

The above lemma can be applied to a Home-MEG % = T-L{n,p,q,a,j) with A = H = 
{HC, HD} and D as above. Notice that 

tt{U) = ^— and 7r(HnD) = ^^^~"^ (9) 

A = (l-g)(l-a) and S = {1 - q) (10) 

By replacing the above formulas in Ineq. Q, we get the following 
Corollary 7 Let % = 'H{n,p,q,a,'y) be a Home-MEG. Then, for any i ^ 1 it holds 

vrf -^ ^ 1 - (1 - g)^(l - (1 - a)^) (11) 
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The Flooding Process. In order to manage stochastic dependence, we consider a somewhat 

sub-process of the actual flooding by (only) looking at restricted subsets of informed nodes and 

edges. This subprocess works in consecutive time periods A,-, r = 0, 1, . . .: each time period 

A,- is formed by the time steps tr,tr + 1, ■ ■ ■ ,tr + Lr — 1. The period lengths Lr are defined as 

follows. Define 

K = 2 max < 1, — - > , to = 0, tr = tr-i + L^ 
I 5A J 



where L 



'r-l 
L DA J 

4A log n 



and, for r ^ 2 



Lr = [ ^ if 2K-2logn^A 

I I" n 1 otherwise 

Then, for any time period A,-, define the following node subsets 

Co = {s} 

C'^ = {v ^V\ Ug^^Ci |3f G A^ and w G C^_i s.t. {w,v) G Et-i} 

( C; if |C;| ^ 2K^-Hogn or |C;| ^ n/16 
[ C'r otherwise 

where C'^ is any subset of C'^ of size 2K'^~^ logn. 

Observe that subsets Cr are mutually disjoint and the edges in the cut E{Cr~i,Cr) are not 
considered by the (sub-)fiooding process before time-period r, so in that period they are random 
with the stationary distribution. Moreover, Cr is a subset of the set of informed nodes at the 
end of time period r. 

Phase 1. We now analyze the size of Ci. This time period of the flooding process will be called 

Bootstrap. 

Observe that, by definition of vr^ ' , Ineq. 11 holds under the condition e ^ H at time t = 0. 

So, 

P {£f) = P {£P\£,^) P {£,") + P (ff h^o^) P Ko^) 



H,D 



(by Ineq. 11) ^ TT^'^^TriH) + {1 - tt{H)) 



By applying Ineq. ITT] and Eq. [91 we get 



P{£f) ^ l--^(l-g)^(l-(l-a)^) (13) 



We now assume that £ ^ min{l/a, l(4g)}. Then, we apply Ineq. p] to Ineq. 13 and get the 
following 

Lemma 8 Let £ ^ min{l/a, l/(4g)}, then it holds that 

P {£f) ^ 1 - { (14) 

By using the above lemma, we can prove an upper bound on the bootstrap time. 



15 



Lemma 9 Let % = T-L{n,p,q,a,'y) be a Home-MEG. Then it holds that 

P(|Ci|^21ogn) ^ ^ 



n' 



Thus, the time of Phase 1 is w.h.p. upper bounded by Li 



4A log n 
n 



Proof. Observe that £ = Li satisfies the hypothesis of Lemma [8J We thus apply this lemma for 
£ = Li and get that the P {S^J ^ 1 - ^^. 

Let us now consider every possible edge of the source node and look at its behaviour for the 
first Li time steps. Since edges are mutually independent, we can apply Chernoff's bound [19] 



and, thanks to Ineq. 14, we get that, w.h.p., the number of nodes that will be informed by the 
source node within the first Li steps is at least 21ogn. D 

Phase 2. We can now assume there are at least 21ogn informed nodes and we analyze the 
fiooding process till an arbitrary constant fraction of all the n nodes are informed. This is the 
second phase. 
For every r ^ 1, define the events 

Fr = " \Cr\ ^ § " Xr = " 1^1 ^ 2K^-^logn " 

Ft = \ll=oFt Xr = At=0^t 

where Xq is the event " |Co| ^ " (which happens with probability 1). 
Lemma 10 Assume that [— ] ^ min{l/a, l/(4g)}. For every t ^ 1, it holds that 

P ((F, V Xr) A ^Fr-i A l,_i) ^ ("l - -^"j P (-F,_i A l.-^ 

Proof. Define the event 

B= "El^^Kg" 

We show that the following implication holds 

^BAXr-l => Fr-l (15) 

If T = 1, it trivially holds since -^B is false. As for r ^ 2, let us assume by contradiction that 
event -F-r- 1 does not hold. Since Xr-i holds, according to the definition of Cr, it must be the 
case |Cr| = XICt— i| with K ^ 2. This implies that 



r-l 

n 



\Ct-i\ ^ 2_^ 1^' 



T-2 



t=0 

It follows that 



Since ^B holds, we have that 



2 

t=0 4=0 



I ^ ii ^ 2^^' ' 16 

i=0 



16 



and Fr-i holds, a contradiction. 



It is immediate to see that ( 15 ) imphes the following 

^Fr-l AXr-1 = 

It derives that 



and 



Moreover, if 

P f-Fr-i A Xr-i AB] =0 

then the thesis trivially holds. Thus, suppose that 

P (^Fr^l A Xr-l /\B] >0 



B 



P (F^ V Xr) A -^Fr-i A Xr-l = P (F^ V X^) A -^Fr-i A X^_i A B 



P -^Fr-l A Xr-l = P ^Fr-l A Xr-l A B 



Then, define event W = -^Fr-i A Xr-i A -B, then the thesis can be reformulated as follows 

1 



V{Fry Xr \W)^1 

We first consider the case r = 1. In this case, we get 



n^ 



P(Fi VXi I W) = 

since P(VF) = 1 ^ P (Fi V Xi) ^ P (|C7i| ^ 21ogn) 

from Lemma [9] ^ 1 7^ 



n^ 



Consider now a time period r ^ 2 and, for any node w G F \ Uq Cj, define r.v. Yj, 

veC'r. 

Since \^~\ ^ min{l/a, l/(4q)}, Lt- satisfies the hypothesis of Lemma Sl It holds that 



1 iff 



E[|c;i I T^] = E 



E^^ 



w 



E 






A 



ic;_ir 



w 



> E 



7n / / Lr\C'^_i\ 

1 — exp 



A 



Two cases may arise. 

Case A: 2K'^~^logn ^ A. In this case Lr = I and from Ineq. 16 



W 



we get 



^[\C'r\\W]^ 'iil-e-^^) > y(i-^)^ 



7n 
16 



(16) 



17 



Case B: 2K^"^logn < A. From Eq. 12, it holds 



Lr 



5A 



n 



From Ineq. 16 we get 

E [|C;| I VF] ^ E 



-— 1 — exp — -' " ' 



A 



W 



Observe that event W imphes Xr~i which in turn imphes |C^_i| ^ 27^"^ logn. It holds that 



, 7n f ( r5Al2K--2log 

E[|C;| liy] ^— 1-exp -^1 ^ 



n 



A / / 

We again distinguish two subcases. If A ^ ^/5, from the above inequality, we get 

E[|c;iiiy]>f(i-e.p(-M:^ 

Since -i-F implies '^^^ ^ 1, we can apply (3) and get 

L' ^' ' J 8 2n 16 32 ^ 

where the last inequality follows from the fact that, in this subcase, K = 2. 



(17) 



As for the second subcase A < n/5, since [(5A)/n] = 1, Ineq. 17 becomes 

E[|C^;i|H^]^y(l-exp(- 



A 



Since again -iF implies ^ °^" ^ 1, we can apply (3) and get 

In both cases, we can apply Chernoff's bound on the sum C^ of independent r.v. and then, 
thanks to the definition of Cr, get the thesis. D 

Lemma 11 Assume that \—~\ ^ min{l/a, l/(4g)}. Then, for every t ^ 1, it holds that 



(Pr V Xr 



> 1 



n^ 



Proof. For the sake of convenience, let A^ = i<V V Xr- The proof is by induction on r. For 

T = l, 

Fi = Fi and Xi = Xi 



Moreover, 



P(^FoAXo]=l 



Thus, from Lemma 10 we have 



P (yli) ^ 1 



n^ 
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Now, assume that r ^ 2. It holds that 

P {Ar) = P(FrV Fr-1 V {X^ A X 

^ P ((F, V X,) A ^,_i) 

= pfF^VX, \Ar-l)P{Ar-l 



^ I 1 - ^^ ) P ( F, V X, I Ar-l 



where the last inequality is due to the induction hypothesis. We have that 



(18) 



P F^VX^|yl^_i = P F^VX^ 



A,_iAF,_i P F,_i 1^,-1 + 



+P F, V Xr 



Ar-l A ^Fr-l P -F,_l I Ar-l (19) 



Since Ar-i A F^-i =^ F^ =^ F^ V X^, it holds that 



P F, V X^ 



Ar-l A F, 



T-l 



(20) 



Since Ar-l A ^Fr_i =Xr_iA^Fr_i and (F^ VX^) AX^-i A^F^-i = (F^ VX^) AX^-i A^F^-i, 
it holds that 



PiFrVXr 



Ar-l A -^Fr-1 = P Fr V Xr 



^Fr-1 A Xr_i ^ 1 



1 



n^ 



(21) 



where the last inequality follows from Lemma 10, By combining Ineq.s 19 20, and 21 , we obtain 

1 



P ( Fr V Xr I Ar-l ) ^ P ( Fr_l I Ar-l ) + ( 1 " ^ ) P ( -F^-l | Ar-l ] > 1 



n^ 



From Ineq. 18 we obtain 



P (Ar) ^ 1 



T- 1 



n^ 



n^ 



> 1 



n^ 



D 



Lemma 12 LetH = 7i(n,p,q,a,'j) be a Home-MEG and assume that \—~\ ^ min{l/a, l/(4g)}. 
Then 



P 3 T^ 



logn 
logiT 



\Cr\^^] ^1 ^^^'^ 



16 



n^ 



Proof. We observe that for any r ^ "^i" k"^" "1"-^' event Xr does not hold. Then, for r = Ll^^J > 



Lemma 11 implies 



P Fr = P Fr V Xr ^ 1 



^ 1 



logn 



n^ 



n' 



D 



Thanks to the above lemma, we can now bound the time of Phase 2. 



Lemma 13 LefH = V.{n,p,q,a,^) be a Home-MEG and assume that \^] ^ min{l/a, l/(4g)}. 
Then the time of Phase 2 is w.h.p. O i ^^^ \^ 
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Phase 3. During Phase 2, we have that, w.h.p., a time period f exists such that 

\Cr\ ^ n/16 (22) 

Now, let's look at the nodes in Cf and at the edges from such nodes to a node v €V\ Uq^ Cj. 
We observe such "new" edges for (. consecutive time steps after the end of time period f. Define 
binary r.v. 1^ = 1 iff some of such edges is on in at least one of these I time steps. We make 



use of Lemma pi and Ineq. 22 to derive 



i \ ''1^^ ( in 

P(K„ = 0) ^ (1-1 ^exp^ 



A ; "^ V 16A 

By choosing i ^ — ^r^' ^^"^ ^-PPlyiiig the union bound over all 1^'s, we have that, w.h.p., all 
nodes will be connected to some node in Cf within the next i time steps. We thus have proved 
that 

Lemma 14 Let T-L = 'H{n,p,q,a,^) be a Home-MEG and assume [— ] ^ min{l/a, l/(4g)}. 
Then, w.h.p. the time of Phase 3 is upper bounded by 

,'Adogn 
n 



Finally, Theorem [4] follows from the time bounds of the three phases, i.e. Lemmas [9, 13, and 
M 

5 Conclusions 

In this paper, we have introduced a simple opportunistic link model which is able to faithfully 
reproduce the power law+exponential tail dichotomy of inter-contact time typical of human 
mobility, and used this model to derive asymptotic bounds on flooding time in opportunistic 
networks. A major finding of our study is that flooding time in opportunistic networks might 
be much faster than what predicted by existing bounds based on simple geometric mobility 
models which are known not to be able to reproduce the above described dichotomy. 

Our bounds on flooding time - and, more specifically, the proof of Theorem |4j which breaks 
down the flooding process in phases of different duration - could be very useful in estimating 
the propagation of proximity malware (i.e., malware propagated through Bluetooth/WiFi in- 
terfaces) in mobile networks [26j. For instance, it has been recently observed in [28J that MMS 
malware (i.e., malware propagated through sending MMSs) in cellular networks displays a slow 
start followed by an exponential propagation. Our results can be used to investigate whether a 
similar propagation pattern is displayed also in proximity malware. 

A major avenue for further improving our results is introducing dependency between the 
Markov chains modeling the possible links in the network, in order to account for physical 
mobility constraints and/or social relationships between nodes. Deriving bounds on flooding 
time with dependent pair-wise links is a challenging task, which we are currently undertaking. 
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